智能论文笔记

Ab-initio quantum chemistry with neural-network wavefunctions

Jan Hermann , James Spencer , Kenny Choo , Antonio Mezzacapo , W. M. C. Foulkes , David Pfau , Giuseppe Carleo , Frank Noé

分类：机器学习 | (统计)机器学习

2022-08-26

机器学习，特别是深度学习方法在许多模式识别和数据处理问题，游戏玩法中都优于人类的能力，现在在科学发现中也起着越来越重要的作用。机器学习在分子科学中的关键应用是通过使用密度函数理论，耦合群或其他量子化学方法获得的电子schr \“ odinger方程的Ab-Initio溶液中的势能表面或力场。我们回顾了一种最新和互补的方法：使用机器学习来辅助从第一原理中直接解决量子化学问题。具体来说，我们专注于使用神经网络ANSATZ功能的量子蒙特卡洛（QMC）方法，以解决电子SCHR \ “ Odinger方程在第一和第二量化中，计算场和激发态，并概括多个核构型。与现有的量子化学方法相比，这些新的深QMC方法具有以相对适度的计算成本生成高度准确的Schr \“ Odinger方程的溶液。

translated by 谷歌翻译

HTML版本

Introduction to Machine Learning for the Sciences

Titus Neupert , Mark H Fischer , Eliska Greplova , Kenny Choo , M. Michael Denner

分类：机器学习

2021-02-08

这是一门专门针对STEM学生开发的介绍性机器学习课程。我们的目标是为有兴趣的读者提供基础知识，以在自己的项目中使用机器学习，并将自己熟悉术语作为进一步阅读相关文献的基础。在这些讲义中，我们讨论受监督，无监督和强化学习。注释从没有神经网络的机器学习方法的说明开始，例如原理分析，T-SNE，聚类以及线性回归和线性分类器。我们继续介绍基本和先进的神经网络结构，例如密集的进料和常规神经网络，经常性的神经网络，受限的玻尔兹曼机器，（变性）自动编码器，生成的对抗性网络。讨论了潜在空间表示的解释性问题，并使用梦和对抗性攻击的例子。最后一部分致力于加强学习，我们在其中介绍了价值功能和政策学习的基本概念。

translated by 谷歌翻译

ReFu: Refine and Fuse the Unobserved View for Detail-Preserving Single-Image 3D Human Reconstruction

Gyumin Shim , Minsoo Lee , Jaegul Choo

分类：计算机视觉

2022-11-09

Single-image 3D human reconstruction aims to reconstruct the 3D textured surface of the human body given a single image. While implicit function-based methods recently achieved reasonable reconstruction performance, they still bear limitations showing degraded quality in both surface geometry and texture from an unobserved view. In response, to generate a realistic textured surface, we propose ReFu, a coarse-to-fine approach that refines the projected backside view image and fuses the refined image to predict the final human body. To suppress the diffused occupancy that causes noise in projection images and reconstructed meshes, we propose to train occupancy probability by simultaneously utilizing 2D and 3D supervisions with occupancy-based volume rendering. We also introduce a refinement architecture that generates detail-preserving backside-view images with front-to-back warping. Extensive experiments demonstrate that our method achieves state-of-the-art performance in 3D human reconstruction from a single image, showing enhanced geometry and texture quality from an unobserved view.

translated by 谷歌翻译

The Benefits of Model-Based Generalization in Reinforcement Learning

Kenny Young , Aditya Ramesh , Louis Kirsch , Jürgen Schmidhuber

分类：机器学习

2022-11-04

Model-Based Reinforcement Learning (RL) is widely believed to have the potential to improve sample efficiency by allowing an agent to synthesize large amounts of imagined experience. Experience Replay (ER) can be considered a simple kind of model, which has proved extremely effective at improving the stability and efficiency of deep RL. In principle, a learned parametric model could improve on ER by generalizing from real experience to augment the dataset with additional plausible experience. However, owing to the many design choices involved in empirically successful algorithms, it can be very hard to establish where the benefits are actually coming from. Here, we provide theoretical and empirical insight into when, and how, we can expect data generated by a learned model to be useful. First, we provide a general theorem motivating how learning a model as an intermediate step can narrow down the set of possible value functions more than learning a value function directly from data using the Bellman equation. Second, we provide an illustrative example showing empirically how a similar effect occurs in a more concrete setting with neural network function approximation. Finally, we provide extensive experiments showing the benefit of model-based learning for online RL in environments with combinatorial complexity, but factored structure that allows a learned model to generalize. In these experiments, we take care to control for other factors in order to isolate, insofar as possible, the benefit of using experience generated by a learned model relative to ER alone.

translated by 谷歌翻译

cRedAnno+: Annotation Exploitation in Self-Explanatory Lung Nodule Diagnosis

Jiahao Lu , Chong Yin , Kenny Erleben , Michael Bachmann Nielsen , Sune Darkner

分类：计算机视觉

2022-10-28

Recently, attempts have been made to reduce annotation requirements in feature-based self-explanatory models for lung nodule diagnosis. As a representative, cRedAnno achieves competitive performance with considerably reduced annotation needs by introducing self-supervised contrastive learning to do unsupervised feature extraction. However, it exhibits unstable performance under scarce annotation conditions. To improve the accuracy and robustness of cRedAnno, we propose an annotation exploitation mechanism by conducting semi-supervised active learning with sparse seeding and training quenching in the learned semantically meaningful reasoning space to jointly utilise the extracted features, annotations, and unlabelled data. The proposed approach achieves comparable or even higher malignancy prediction accuracy with 10x fewer annotations, meanwhile showing better robustness and nodule attribute prediction accuracy under the condition of 1% annotations. Our complete code is open-source available: https://github.com/diku-dk/credanno.

translated by 谷歌翻译

PePe: Personalized Post-editing Model utilizing User-generated Post-edits

Jihyeon Lee , Taehee Kim , Yunwon Tae , Cheonbok Park , Jaegul Choo

分类：自然语言处理 | 人工智能

2022-09-21

合并个人喜好对于高级机器翻译任务至关重要。尽管机器翻译最近进步，但正确反映个人风格仍然是一项艰巨的任务。在本文中，我们引入了一个个性化的自动后编辑框架来应对这一挑战，该挑战有效地产生了考虑不同个人行为的句子。为了构建此框架，我们首先收集后编辑数据，该数据表示来自Live Machine Translation系统的用户偏好。具体而言，现实世界的用户输入源句子进行翻译，并根据用户的首选样式编辑机器翻译的输出。然后，我们提出了一个模型，该模型结合了APE框架上的歧视器模块和特定于用户的参数。实验结果表明，该方法的表现优于四个不同指标（即BLEU，TER，YISI-1和人类评估）的其他基线模型。

translated by 谷歌翻译

Morphology-Aware Interactive Keypoint Estimation

Jinhee Kim , Taesung Kim , Taewoo Kim , Jaegul Choo , Dong-Wook Kim , Byungduk Ahn , In-Seok Song , Yoon-Ji Kim

分类：计算机视觉 | 人工智能

2022-09-15

基于医学图像（例如X射线图像）的诊断通常涉及解剖关键的手动注释。但是，这个过程涉及大量的人类努力，因此可以成为诊断过程中的瓶颈。为了充分自动化此过程，基于深度学习的方法已被广泛提出，并在检测医学图像中的关键点方面达到了高性能。但是，这些方法仍然存在临床局限性：无法保证所有情况的准确性，并且医生必须对所有模型的所有预测进行仔细检查。作为回应，我们提出了一个新颖的深神经网络，鉴于X射线图像，它可以通过用户相互作用的系统自动检测和完善解剖学关键点，在该系统中，医生可以以比手动修订过程中所需的点击率更少的点击量来修复错误预测的关键。使用我们自己的收集数据和公开可用的AASCE数据集，我们证明了该方法通过广泛的定量和定性结果来降低注释成本的有效性。我们的项目网页上提供了有关我们方法的演示视频。

translated by 谷歌翻译

DASH: Visual Analytics for Debiasing Image Classification via User-Driven Synthetic Data Augmentation

Bum Chul Kwon , Jungsoo Lee , Chaeyeon Chung , Nyoungwoo Lee , Ho-Jin Choi , Jaegul Choo

分类：计算机视觉

2022-09-14

图像分类模型通常会学会根据输入功能与培训数据中输出类之间的无关共发生进行预测类。我们称不需要的相关性为“数据偏见”，视觉特征导致数据偏见为“偏见因素”。在没有人类干预的情况下自动识别和减轻偏见是一个挑战。因此，我们进行了一项设计研究，以找到人类的循环解决方案。首先，我们确定了用三个专家捕获图像分类模型的偏差缓解过程的用户任务。然后，为了支持任务，我们开发了一个名为DASH的视觉分析系统，该系统允许用户在视觉上识别偏见因素，使用最先进的图像到图像到图像转换模型迭代生成合成图像，并监督改善分类精度的模型培训过程。我们对十名参与者的定量评估和定性研究证明了破折号的实用性，并为将来的工作提供了教训。

translated by 谷歌翻译

Residual Correction in Real-Time Traffic Forecasting

Daejin Kim , Youngin Cho , Dongmin Kim , Cheonbok Park , Jaegul Choo

分类：机器学习

2022-09-12

预测交通状况非常具有挑战性，因为每条道路在空间和时间上都高度依赖。最近，为了捕获这种空间和时间依赖性，已经引入了专门设计的架构，例如图形卷积网络和时间卷积网络。尽管流量预测取得了显着进展，但我们发现基于深度学习的流量预测模型仍然在某些模式中失败，主要是在事件情况下（例如，快速速度下降）。尽管通常认为这些故障是由于不可预测的噪声造成的，但我们发现可以通过考虑以前的失败来纠正这些故障。具体而言，我们观察到这些失败中的自相关错误，这表明仍然存在一些可预测的信息。在这项研究中，为了捕获错误的相关性，我们引入了Rescal，Rescal是流量预测的剩余估计模块，作为广泛适用的附加模块，用于现有的流量预测模型。我们的恢复通过使用以前的错误和图形信号来估算未来错误，从而实时校准现有模型的预测。对METR-LA和PEMS-BAY进行的广泛实验表明，我们的恢复可以正确捕获错误的相关性，并在事件情况下纠正各种流量预测模型的故障。

translated by 谷歌翻译

Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity

Taehee Kim , ChaeHun Park , Jimin Hong , Radhika Dua , Edward Choi , Jaegul Choo

分类：自然语言处理

2022-08-29

语义上有意义的句子嵌入对于自然语言处理中的许多任务都很重要。为了获得此类嵌入，最近的研究探讨了利用验证语言模型（PLM）作为训练语料库的合成生成数据的想法。但是，PLM通常会产生与人类写的句子大不相同的句子。我们假设将所有这些合成示例同样地用于训练深层神经网络可能会对学习语义上有意义的嵌入产生不利影响。为了分析这一点，我们首先训练一个分类器来识别机器编写的句子，并观察到机器编写的句子的语言特征与人写的句子的语言特征大不相同。基于此，我们提出了一种新颖的方法，该方法首先训练分类器来衡量每个句子的重要性。然后，分类器的蒸馏信息用于训练可靠的句子嵌入模型。通过对四个现实世界数据集的广泛评估，我们证明了我们的合成数据训练的模型可以很好地概括并表现优于现有基线。我们的实现可在https://github.com/ddehun/coling2022_reweighting_sts上公开获得。

translated by 谷歌翻译

HTML版本